Algorithms for Nash Equilibria in General-Sum Stochastic Games

نویسندگان

H. L. Prasad

Prashanth L. A.

Shalabh Bhatnagar

چکیده

Over the past few decades the quest for algorithms to compute Nash equilibria in general-sum stochastic games has intensified and several important algorithms (cf. [9], [12], [16], [7]) have been proposed. However, they suffer from either lack of generality or are intractable for even medium sized problems or both. In this paper, we first formulate a non-linear optimization problem for stochastic games and then break it down into simpler subproblems that ensure there is no Bellman error for a given state and agent. Next, we derive a set of novel necessary and sufficient conditions for solution points of these sub-problems to be Nash equilibria of the underlying game. Using these conditions, we develop two novel algorithms OFF-SGSP and ON-SGSP,respectively. OFF-SGSP is an off-line centralized algorithm which assumes complete information of the game. On the other hand, ON-SGSP is an online decentralized algorithm that works with simulated transitions of the stochastic game. Both algorithms are guaranteed to converge to Nash equilibrium strategies for general-sum (discounted) stochastic games.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games

We consider the problem of finding stationary Nash equilibria (NE) in a finite discounted general-sum stochastic game. We first generalize a non-linear optimization problem from [9] to a general N player game setting. Next, we break down the optimization problem into simpler sub-problems that ensure there is no Bellman error for a given state and an agent. We then provide a characterization of ...

متن کامل

A Study of Gradient Descent Schemes for General-Sum Stochastic Games

Zero-sum stochastic games are easy to solve as they can be cast as simple Markov decision processes. This is however not the case with general-sum stochastic games. A fairly general optimization problem formulation is available for general-sum stochastic games by Filar and Vrieze [2004]. However, the optimization problem there has a non-linear objective and non-linear constraints with special s...

متن کامل

Stochastic Learning of Equilibria in Games: The Ordinary Differential Equation Method

Our purpose is to discuss stochastic algorithms to learn equilibria in games, and their time of convergence. To do so, we consider a general class of stochastic algorithms that converge weakly (in the sense of weak convergence for stochastic processes) towards solutions of particular ordinary differential equations, corresponding to their mean-field approximations. Tuning parameters in these al...

متن کامل

Fast Planning in Stochastic Games

Stochastic games generalize Markov decision processes (MDPs) to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards determined by multiplayer matrix games at each state. We consider the problem of computing Nash equilibria in stochastic games, the analogue of planning in MDPs. We begin by providing a generalization of nite-horizon v...

متن کامل

On Nash Equilibria in Stochastic Games

We study in nite stochastic games played by n-players on a nite graph with goals given by sets of in nite traces. The games are stochastic (each player simultaneously and independently chooses an action at each round, and the next state is determined by a probability distribution depending on the current state and the chosen actions), innite (the game continues for an in nite number of rounds),...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1401.2086 شماره

صفحات -

تاریخ انتشار 2014

Algorithms for Nash Equilibria in General-Sum Stochastic Games

نویسندگان

چکیده

منابع مشابه

Two-Timescale Algorithms for Learning Nash Equilibria in General-Sum Stochastic Games

A Study of Gradient Descent Schemes for General-Sum Stochastic Games

Stochastic Learning of Equilibria in Games: The Ordinary Differential Equation Method

Fast Planning in Stochastic Games

On Nash Equilibria in Stochastic Games

عنوان ژورنال:

اشتراک گذاری